Provenance Tracking in R
نویسندگان
چکیده
The CXXR project aims gradually to reengineer the fundamental parts of the R interpreter from C into C++ in such a way that: • the full functionality of the standard distribution of R (including the recommended packages) is preserved; • the behaviour of R code is unaffected (unless it probes into interpreter internals); • there is no change to the existing interfaces for calling out from R to other languages such as C or Fortran, nor to the main APIs for calling into R. CXXR achieves a high degree of compatibility with R packages from the CRAN repository: see [1].
منابع مشابه
Provenance-Awareness in R
It is generally acknowledged that when, in 1988, John Chambers and Richard Becker incorporated the S AUDIT facility into their S statistical programming language and environment, they created one of the first provenance-aware applications. Since then, S has been spiritually succeeded by the open-sourceR project; however,R has no such facility for tracking provenance. This paper looks at how pro...
متن کاملA Query Language of Data Provenance Based on Dependency View for Process Analysis
For the scale of data in process keep increasing, data provenance also becomes large and constantly growing, which brings challenges to the efficiency of provenance tracking in process analysis. This paper proposes a kind of dependency view to extract a global data provenance description of the data process instance, and then defines a contextual query language based on dependency view to imple...
متن کاملA Distributed Provenance Aware Storage System
The provenance of a file represents the origin and history of the file data. A Distributed Provenance Aware Storage System (DPASS) tracks the provenance of files in a distributed file system. The provenance information can be used to identify potential dependencies between files in a filesystem. Some applications of provenance tracking include (i) tracking the transformations applied to process...
متن کاملProvenance Issues in Platform-as-a-Service Model of Cloud Computing
In this paper we present provenance issues that arise in building Platform-as-a-Service (PaaS) model of cloud computing. The issues are related to designing, building, and deploying of the platform itself, and those related to building and deploying applications on the platform. These include, tracking of commands for successful software installations, tracking of inter-service dependencies, tr...
متن کاملTracking Emigrant Data via Transient Provenance
Information leaks are a constant worry for companies and government organizations. After a leak occurs it is very important for the data owner to not only determine the extent of the leak, but who originally leaked the information. We propose a technique to extend data provenance to aid in determining potential sources of information leaks. While data provenance is commonly defined as the ances...
متن کاملOceanographic Data Provenance Tracking with the Shore Side Data System
The importance of tracking the provenance of electronic data becomes apparent when data set providers need to also provide metadata describing where the data came from. This need has driven the development of a practical oceanographic data provenance system at the Monterey Bay Aquarium Research Institute. MBARI’s Shore Side Data System is designed to manage data collected, processed, and archiv...
متن کامل